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Abstract 



This article continues study of the prcqucntial framework for evaluating a prob- 
ability forecaster. Testing the hypothesis that the sequence of forecasts issued 
by the forecaster is in agreement with the observed outcomes can be done using 
prequential notions of probability. It turns out that there are two natural no- 
tions of probability in the prequential framework: game-theoretic, whose idea 
goes back to von Mises and Ville, and measure-theoretic, whose idea goes back 
to Kolmogorov. The main result of this article is that, in the case of predicting 
binary outcomes, the two notions of probability in fact coincide on the analytic 
sets (in particular, on the Borel sets). 
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1 Background 



The prequcntial framework for evaluating probability forecasters was introduced 
by A. P. Dawid in [4] and [5]. Suppose two players, Forecaster and Reality, 
interact according to the following protocol. 

Binary prequential protocol 

FOR n = 1,2,...: 

Forecaster announces p n G [0,1]. 

Reality announces y n £ {0, 1}. 
END FOR. 

The interpretation is that p n is Forecaster's subjective probability that y n = 
1 after having observed j/i,. .. ,y n -i an d taking account of all other relevant 
information available at the time of issuing the forecast. We will refer to p n 
as forecasts and to y n as outcomes. More generally, the outcomes take values 
in an arbitrary measurable space and the forecasts are probability distributions 
on that measurable space, but in this article we will restrict our attention to 
binary outcomes (as in [5]); this will be further discussed at the end of Section 
7. 

In general, the two players possess perfect information about each other's 
moves: Forecaster chooses pi, Reality observes pi and chooses y\, Forecaster 
observes y\ and chooses p\, etc. We might, however, be interested in "oblivi- 
ous" strategies for a player, especially for Reality, who may generate her moves 
randomly according to a probability measure on {0, 1}°° chosen in advance. On 
the other hand, the players may also react to events outside the protocol. 

Dawid's prequential principle (see, e.g., [4, 5, 6]) says that when testing the 
adequacy of the forecaster in light of the outcomes y n we should only use the 
forecasts p n , not the forecasting strategy (if any) that Forecaster used to produce 
p n . In this article we will be only interested in testing procedures that respect 
the prequential principle. In other words, we will be interested in testing the 
sequence 



of forecast /outcome pairs (p n , y n ) for agreement. This sequence may be infinite 
or finite. 

There are two main ways to test sequences (1) for agreement, which we 
will call game-theoretic and measure-theoretic. For concreteness, suppose the 
sequence (1) does not satisfy 



(i.e., the sequence is not "unbiased in the large"; see, e.g., [6] for numerous 
other ways of testing probability forecasts). What do we mean when we say 
that violation of (2) evidences lack of agreement? 



(Pl,2/l,P2,2/2, • ■ •) 
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Two ways to answer this question correspond to two different approaches 
to the foundations of probability theory. One version of the game-theoretic 
answer is that we can gamble against the forecasts is such a way that, risking 
only one monetary unit, we can become infinitely rich when (2) is violated. 
The measure-theoretic answer is that, no matter what strategy Forecaster is 
using, the probability of (2) is one; therefore, if (2) is violated, an a priori 
specified event of probability zero (given by the negation of the formula (2)) has 
occurred. Both becoming infinitely rich and the occurrence of a pre-spccificd 
event of probability zero can be interpreted as lack of agreement between the 
forecasts and outcomes. 

In fact, even the first answer can be expressed in terms of probability. The 
game-theoretic approach to the foundations of probability is as old as the stan- 
dard measure-theoretic based on Kolmogorov's axioms ([11]; see [17] for the 
historical background). An imperfect version of the game-theoretic approach 
was championed by von Mises [14] and formalized, in different ways, by Wald 
[24] and Church [2] . Ville [20] gave an example demonstrating that von Mises's 
notion of a gambling strategy was too restrictive, and introduced a more gen- 
eral class of gambling strategies and a closely related notion of a martingale. 
However, the formal notion of game-theoretic probability was introduced only 
recently (see, e.g., [21], [6], or, for a much fuller treatment, [16]). In particular, 
an event has zero game-theoretic probability if and only if there is a gambling 
strategy that, risking at most one monetary unit, makes the player infinitely 
rich when the event happens. 

The notion of game-theoretic probability makes the game-theoretic and 
measure-theoretic justifications of the testing procedure based on (2) look very 
similar: we just say that the probability (cither game-theoretic or measure- 
theoretic) of (2) being violated is zero. The main result of this article says 
that the two notions of probability coincide on the analytic sets, and so the 
two approaches to testing probability forecasts are equivalent, in the prequen- 
tial framework. The restriction to the analytic, and even Borel, sets is not a 
limitation in all practically interesting cases. 

For testing procedures based on events of probability zero (basically, on 
strong laws of probability theory, such as (2)), a special case of our result is 
sufficient: it is sufficient to know that a Borel set has zero game-theoretic prob- 
ability if and only if it has zero measure-theoretic probability. Our full result is 
also applicable to events of merely low, not zero, probability. For example, we 
could reject the hypothesis of agreement if 



for prespecified large numbers C and n. Our result shows that this and similar 
procedures have equally strong game-thcorctic and measure-theoretic justifica- 
tions. Notice that in the case of (3) our decision to reject the hypothesis of 
agreement can be made after observing a finite sequence, (pi, yi, . . . ,p„, y n ). 




(3) 
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2 This article 



In the following two sections, 3 and 4, we formally introduce in the prequential 
framework the two notions of probability discussed in the previous section. The 
main result of this article is Theorem 1 in Section 5, asserting the coincidence 
of the two kinds of probability on the analytic sets. This result has several 
predecessors. In the situation where Forecaster's strategy is fixed, Villc ([20], 
Theorems 1 and 2 in Chapter IV) showed that a set E has game-theoretic prob- 
ability zero if and only if it has measure-theoretic probability zero. (Ville stated 
this result in slightly different terms, without explicit use of game-theoretic prob- 
ability.) This was generalized in [16] (Proposition 8.13) to the statement that 
game-theoretic and measure-theoretic probability coincide on the Borel sets. In 
the case of a finite-horizon protocol, a statement analogous to Theorem 1 was 
proved by Shafer in [15] (Proposition 12.7.4). The special case of Theorem 1 
asserting the coincidence of game-theoretic and measure-theoretic probability 
on the open sets was first proved in [23] (Theorem 2). 

In the same Section 5 we also prove that measure-theoretic probability never 
exceeds game-theoretic probability. This simple statement is true for all sets, 
not just analytic. The proof of the opposite inequality is given in Section 6. 
It relies on two fundamental results: Choquet's capacitability theorem [1] and 
Levy's zero-one law in its game-theoretic version recently found in [22]. 

Some notation and definitions 

The set of all natural (i.e., positive integer) numbers is denoted N, N := 
{1, 2, . . .}. As always, R is the set of all real numbers. 

Let ft :~ {0, 1}°° be the set of all infinite binary sequences and fP := 
U£° =o {0, 1}" be the set of all finite binary sequences. Set II := ([0, 1] x {0, 1})°° 
and IP := U£L ([0, 1] x {0, 1})™. The empty element (sequence of length zero) 
of both fP and IP will be denoted A. In our applications, the elements of fl 
and O will be sequences of outcomes (infinite or finite), and the elements of II 
and IP will be sequences of forecasts and outcomes (infinite or finite). The set 
II will sometimes be referred to as the prequential space. 

For x G let T(x) C O be the set of all infinite extensions of x that belong 
to f2. Similarly, for x G 14°, T(x) C II is the set of all infinite extensions of x 
that belong to 14. For each uj = (2/1,2/2, • ■ •) G ^ and n G N U {0}, set uj n := 
(j/i, . . . ,2/n)- Similarly, for each tt = (pi, 2/1,^2, 2/2, • ■ •) G 14 and n G NU {0}, set 
7T™ := . . . ,p n ,y n ). 

In some proofs and remarks we will be using the following notation, for 
n G N U {0}: Q n := {0, 1}™ is the set of all finite binary sequences of length 
n; fl- n :— U°^„1P is the set of all finite binary sequences of length at least n; 
14" := ([0, 1] x {0, 1})"; 14^" := Ug„([0, 1] x {0, 1})\ 
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3 Game- theoretic prequential probability 

A farthingale is a function V : 11° — > (—00,00] satisfying 
V(pi,2/i, . . . ,p n -i,y n -i) 

= (1 -Pn)V(pi,J/l, . . . ,p n -l,y„-l,p n ,0) 

+ p n V(p 1 ,yi, . . . ,p n -i,y n -i,p n ,l) (4) 

for all n e N and all (pi, yi,p2, yi, ■ ■ •) € II; the product Ooo is defined to be 
0. If we replace "=" by ">" in (4), we get the definition of a superfarthingale. 
These are prequential versions of the standard notions of martingale and su- 
permartingale. We will be interested mainly in non-negative farthingales and 
supcrfarthingales . 

The value of a farthingale can be interpreted as the capital of a gambler 
betting according to the odds announced by Forecaster. In the case of super- 
farthingales, the gambler is allowed to throw away part of his capital. 

Game-theoretic probability can be introduced as either upper or lower prob- 
ability; in this article the former is more convenient (and was used in the infor- 
mal discussion of Section 1). A prequential event is a subset of II. The upper 
game-theoretic probability of a prequential event E is 

P gamc (S) := inf jo I 3V : V(A) = a and Vtt e E : limsup V{ir n ) > 1 j , (5) 

where V ranges over the non- negative farthingales. It is clear that we will obtain 
the same notion of upper game-theoretic probability if we replace the > in (5) 
by >, replace limsup by sup or liminf (we can always stop when 1 is reached), 
or allow V to range over the non-negative superfarthingales. 

We will need the following property of countable subadditivity of game- 
theoretic probability. 

Lemma 1. For any sequence E\, E2, ■ . ■ of prequential events, 

00 

pgamc (yj^Ei) < ^ P gam °(£ 4 ). 

i=l 

In particular, ifW axae (E i ) = for all i, then P^ 1110 ^^) = 0. 

Proof. It suffices to notice that the sum of a sequence of non-negative farthin- 
gales is again a non-negative farthingale. □ 

4 Measure-theoretic prequential probability 

A forecasting system is a function <fi : — > [0,1]. Let $ be the set of all 
forecasting systems. For each <fr G $ there exists a unique probability mea- 
sure P^ on n (equipped with the Borel cr-algebra) such that, for each x £ f2°, 
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P^,(r(xl)) = 4>{x) P<^(r(a;)). (In other words, such that </>(x) is a version of the 
conditional probability, according to P^, that x will be followed by 1.) The no- 
tion of a forecasting system is close to that of a probability measure on 17: the 
correspondence t— > P^ becomes an isomorphism if we only consider forecasting 
systems taking values in the open interval (0, 1) and probability measures taking 
positive values on the sets T(x), x e 0°. 

For each sequence (2/1, . . . , y n ) G and each forecasting system e let 

(y 1 , . . . ,y„) := (0(A), 2/1, 0(?/i), y 2 , . . . , 0(2/i, ■ ■ ■ , y n -i), Vn) € n°. 
Similarly, for each (?/i, y 2 , ■ ■ ■) € and each 

(yi,2/2, • ■ -) : = (</»(A),yi, 0(2/1), 2/2,0(2/1,2/2), 2/3, ■ ••) e n. 

We can apply the idea of measure-theoretic probability to prcquential events 
as follows, in the spirit of [9], Section 10.2. For each forecasting system and 
prequential event iiCII, define 

F+(E) := P {lo e n I ^ e E) = P (^), 

where := {uj e fi | oj^ € i^} and P<^(^4) is understood, in general, as the outer 
measure of A, i.e., as infaP^i?), B ranging over the Borel sets containing A. 
The convention about using the outer measure is important only for our proofs, 
not for the statement of the main result: according to Luzin's theorem (see, 
e.g., [10], Theorem 21.10), every analytic set is universally measurable, and E& 
is analytic whenever E is. Now we define the upper measure-theoretic probability 
of E as 

P moas (£) := su P P*(S). (6) 

Remark 1. Our definition (6) is not fully adequate from the intuitive point of 
view: even if we are willing to assume that Forecaster follows some forecasting 
strategy (which is a non-trivial assumption: cf. the discussion in [5], pp. 1255- 
1256), why should this forecasting strategy depend only on the past outcomes? 
For example, a meteorologist forecasting rain might have data about tempera- 
tures, winds, etc. (See [5], Section 9, for further discussion.) A more satisfactory 
definition would involve a supremum over all probability spaces equipped with 
a filtration and for each such a probability space a further supremum over all 
forecasting systems adapted to the corresponding filtration (with a natural more 
general definition of a forecasting system). Our definition (6) is the simplest one 
mathematically and leads to the strongest inequality F game (E) < ¥ meas (E) (for 
the analytic sets), which is the non-trivial part of our main result, Theorem 1. 

5 Main result 

Now we are have all ingredients needed to state our main result. 
Theorem 1. For all analytic sets ECU, Ps amo (^) = P moas (^). 
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Intuitively, this theorem establishes the equivalence between the purely pre- 
quential and Bayesian viewpoints in the framework of probability forecasting. 
The definition of measure-theoretic probability is Bayesian, in that Forecaster 
is modeled as a coherent subjectivist Bayesian having a joint probability dis- 
tribution over the sequences of outcomes (cf. [3], Section 1); we represent this 
joint probability distribution as a forecasting system. Rejecting his forecasts 
is the same as rejecting all forecasting systems that could have produced those 
forecasts: cf. the sup^ in (6). The definition of game-theoretic probability is 
purely prequential, in that it does not postulate any joint probability distribu- 
tion behind the forecasts; the latter are used for testing directly. 

Remark 2. As discussed in the previous section (Remark 1), our Bayesian 
forecaster is somewhat naive: he conditions only on the observed outcomes. It 
would be easy (but would complicate the exposition) to allow Reality to issue 
a signal s n , taking one of a finite number of values, before Forecaster chooses 
his forecast p n . Allowing both farthingales and forecasting systems to depend 
on the signals, one could still prove that F s&mc (E) = p mcas (i;) for all analytic 
ECU following the proof of Theorem 1 . 

In this section we will only prove the inequality > in Theorem 1. It turns 
out that this inequality holds for all sets E, not necessarily analytic. 

Theorem 2. For all sets E C n, ps amo (£) > P moas (£;). 

The simple proof of Theorem 2 will follow from Ville's inequality ([20], p. 100; 
in modern probability textbooks this result is often included among "Doob's 
inequalities": see, e.g., [19], Theorem VII. 3.1. III). 

Let <j) be a forecasting system. A martingale w.r. to is a function V : £1° — > 
(—00,00] satisfying 



for all x G fi° (with the same convention Ooo := 0). 

Proposition 1 ([20]). If 4> is a forecasting system, V is a non-negative mar- 
tingale w.r. to (j), and C > 0, 



is a martingale w.r. to (p. It is important that this statement does not require 
measurability of the farthingale V; even if V is not measurable, is always 
measurable, like any other function on £1° (which is why there was no need to 
include the requirement of measurability in our definition of a martingale). 



V(x) = (1 - <f>(x))V(x, 0) + <f>(x)V(x, 1) 




If V is a farthingale, the function : £1° — > (—00, 00] defined by 
V^ix) := V (x*) , xeil , 
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Proof of Theorem 2. Let E C II. It suffices to prove that F^E*) < V(A) 
for any forecasting system and any non-negative farthingale V satisfying 
limsup n V(n n ) > 1 for all n g £\ Fix such and V". Then is a non-negative 
martingale w.r. to satisfying limsup„ V*(w") > 1 for all w g £"^. Applying 
Proposition 1 to we can see that indeed P (i^) < V*(A) = V(A). □ 



6 Proof of the inequality < in Theorem 1 

We start from proving a special case of Theorem 1. 

Lemma 2. If ECU is a compact set, ¥ meaa (E) = P^ mc (E). 

Proof. Fix a compact prequential event ECU. (Of course, "compact" is the 
same thing as "closed" in this context.) Represent E as the intersection E = 
(l^l 1 Ei of a nested sequence E\ D E 2 ^2 ■ ■ ■ of closed sets such that 

Vtt g n : 7T g Ei => T(ir l ) C ^ (7) 

is satisfied for all i. Informally, Ei is a property of the first i forecasts and 
outcomes. For each i = 1,2,..., define a superfarthingale Wj by setting 

I otherwise 

for all x g II-* and then proceeding inductively as follows. If W,(x) is already 
defined for x g II™, n = i, i — 1, . . . , 1, define Wj(ar), for each x g n™ _1 , by 

Wi(x):= sup ((l-p)W i (x,p,0)+pW i (x,p,l)). (9) 
P e[o,i] 

It is clear that Wi > W 2 > ■ ■ ■ . 

Let us check that Wi(x) is upper semicontinuous as a function of x g II . 
By (8) this is true for x g II- 4 . Suppose this is true for x g 11", n g — 
1,...,2}, and let us prove that it is true for x g II n_1 , using the inductive 
definition (9). It is clear that f(x,p) := (1 —p)Wi(x,p, 0) +pWi(x,p, 1) is upper 
semicontinuous as function of p e [0,1] and x e IF 1-1 . It is well known that 
sup p f(x,p) is upper semicontinuous whenever / is upper semicontinuous and x 
and p range over compact sets (see, e.g., [7], Theorem 1.2(d)). A simple proof 
of a slightly more general fact will be given below in Lemma 3. Therefore, 
Wi(x) = sup pe r 0;1 ] f(x,p) is an upper semicontinuous function of x g IP -1 . 

An important implication of the upper semicontinuity of Wi and the com- 
pactness of [0, 1] is that the supremum in (9) is attained: it is easy to check that 
an upper semicontinuous function attains its supremum over a compact set (cf. 
[8], Problem 3.12.23(g)). For each i — 1,2,..., we can now define a forecasting 
system fa as follows. For each x g f2 n , n = 0, 1, ... ,i — 1, choose <pi{x) such 
that 
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(1 - ^(x^Wiix^ , <f>i(x), 0) + faWWiix** , 4>i{x), 1) 

= sup((l - p)Wi (x^ ,p,0)+ P W t {x* ,p,l))=Wi {x* ) 
p 

(this is an inductive definition; in particular, is already defined at the time 
of defining <j>i(x)). For x G O- 1 , set, for example, <pi(x) := 0. The important 
property of <pi is that Wf' is a martingale w.r. to (pi, and so P^ 4 (.Ej) = Wi(A). 

Since the set $ of all forecasting systems is compact in the product topol- 
ogy, the sequence <pi has a convergent subsequence <pi k , k = 1,2,...; let 
<p := linifc^oo <p ik . We assume, without loss of generality, i\ < i 2 < ■ ■ ■ ■ Set 

c:=infV^(A) = lim W t (A). 

i i — >oo 

Fix an arbitrarily small e > 0. Let us prove that F^(E^) > c — e. Let K G 
N. The restriction of P^ ifc to Q tK (more formally, the probability measure 
assigning weight P^ (r(a;)) to each singleton {x}, x G il lK ) comes within e of 
the restriction of P^ to f2 lK in total variation distance from some k on; let the 
total variation distance be at most e for all k > K' > K. Let k > K 1 . Since 
P 0lfc (E^ k ) > c, it is also true that P 0ifc {E^ k ) > c; therefore, it is true that 

P ( p(E i ^ k ) > c — e. By Fatou's lemma, we now obtain 

P (lim sup E^ k j > lim sup P^E?^ ) > c - e. (10) 

\ k J k^oc 

Let us check that 

lim sup CEf K . (11) 
fe 

Indeed, let ui £ Ef K , i.e., uj^ £ E iK . Since <p ik — ► <p m the product topology and 
the set Ei K is closed, uA* G' Sj K from some fc on. This means that G 
for only finitely many k, i.e., ui ^ limsup fc i?^* . 

From (10) and (11) we can see that P^(^) - c ~ e ' for a11 ^ ^ N. This 
implies P^ ) > c - e. Since this holds for all e, P (£ ) > c. 

The rest of the proof is easy: since 

pgamc^ <C<P c/> (E' t >) < P mcas (£) < pS amo (£) 

(the last inequality following from Theorem 2), we have 

pgamc (E) = C = F ( p(E' t >) = P m ° aS (E). □ 

In the proof of Lemma 2 we referred to the following simple result. 

Lemma 3. Suppose X and Y are topological spaces and Y is compact. If a 
function f : X x Y — > M. is upper semicontinuous, then the function x G X i— » 
g(x) :— sup y£ y f(x, y) is also upper semicontinuous. 



8 



Proof. For any c G M, we are required to show that the set G :— {x \ 
sup y f(x, y) < c} is open. Let x £ G. For any y £7 there exists a neighborhood 
0' y of x and a neighborhood Oy of y such that, for some e > 0, f(x', y') < c — e 
for all x' £ 0' y and all y' £ Oy. By the compactness of Y, there is a finite family 
Oy ± , . . . , Oy K that covers Y. The intersection of 0' yi , . . . , 0' VK will contain x 
and will be a subset of G. Therefore, G is indeed open. 

The argument in [7], proof of Theorem 1.2(d), is even simpler, but it assumes 
that X is compact (which is, however, sufficient for the purpose of Lemma 
2). □ 

The idea of the proof of Theorem 1 is to extend Lemma 2 to the analytic 
sets using Choquet's capacitability theorem (stated below). Remember that a 
function 7 (such as ps ame r p mcas ) mapping the power set of a topological space 
X (such as II) to [0, 00) is a capacity if: 



for any subsets A and B of X, 



A£B=> 7(A) < 7(5); 



(12) 



for any nested increasing sequence A\ C A2 C • • • of arbitrary subsets of 
X, 

7(U£i^)= hm 7 (A 4 ); (13) 



for any nested decreasing sequence ifi D K2 2 • • • of compact sets in X, 
7 (n£ 1 tf i )= lim 7 (^). (14) 



Condition (14) is sometimes replaced by a different condition which is equivalent 
to (14) for compact metrizable spaces X: cf. [10], Definition 30.1. 

It turns out that both ps amc and P mcas are capacities. We start from ps amc . 

Theorem 3. The set function P« ame is a capacity. 

It is obvious that ps ame satisfies condition (12). The following two statements 
establish conditions (13) and (14). Condition (14) is easier to check: it can be 
extracted from the proof of Lemma 2. 

Lemma 4. If K\ D K2 2 • • • is a nested sequence of compact sets in IT, 

pgamo ( n ~ 1 .) = l im pS ame (^). (15) 

Proof. We will use the equality ps am °(^) = lim woo ps amc (£ 4 ), in the notation 
of the proof of Lemma 2. This equality follows from 

P gamo (£0 = c = lim W l (X) > lim ps amc (^) 
(the opposite inequality is obvious). 
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Represent each K n in the form K n = D°l 1 Ei where E\ D E 2 2 • • • and each 
Ei satisfies (7); we will write K n ^ in place of Ei. Without loss of generality we 
will assume that Ki^ D K 2 .i 5 • • • for all i. Then the set K := fl^-Kj can be 
represented as K = n^iQ^, and so (15) follows from 

P gamc (if ) = ps amc (n£ 1 Jf j , i ) = lim ¥ same (K iti ) = lim lim P samc (K n ^) 

= lim P gamo {n^Knt) = lim P gamo (X„). □ 

To check condition (13) for ps amc ; we will need the game-theoretic version, 
proved in [22], of Levy's zero-one law ([13], Section 41). For each x G IP, define 
the conditional upper game-theoretic probability of E C II by 

P gamo (£ I x) := 

inf I a I 3V : V{x)=a&nd Vtt eEnT(x) : limsupP(7r") > l| , 

where V ranges over the non-negative (super)farthingales. 
Proposition 2 ([22]). Let ECU. For almost all ir e E, 

pgame^ | _» ! (1 6 ) 

as n — > 00. (7n other words, there exists a prequential event N such that 

pgame^) = Q md ( 16 ) hMs f Qr a llw€E\N.) 

Proof. It suffices to construct a non-negative farthingale V starting from 1 that 
tends to 00 on the sequences tt e E for which (16) is not true. Without loss of 
generality we replace "for which (16) is not true" by 

liminfP gamo (£; I tt") < a, 

n^oo 

where a € (0, 1) is a given rational number (see Lemma 1). 

Let 7T be any sequence in II; we will define V(ir n ) by induction for n = 1, 2, . . . 
(intuitively, we will describe a gambling strategy with capital process V) . Start 
with 1 monetary unit: V(A) := 1. Keep setting P(7r") := 1, n — 1,2, . . ., until 
pg ame( E | ^n) < a ( if this never happens, P(tt") will be 1 for all n). Let N x 
be the first n when this happens: ps amo (£; | tt^ 1 ) < a but F^ mc (E | tt") > a 
for all n < N\. Choose a non-negative farthingale Si starting at ir Nl from 1, 
Si(tt Ni ) = 1, whose upper limit exceeds 1/a on all extensions of tt Ni in _E. 
Keep setting V(n n ) := Si(n n ), n = Ni, Ni + 1, . . ., until 5i(7r n ) reaches a value 
si > 1/a. After that keep setting P(7T n ) := P^"- 1 ) until ps amc (£ | tt") < a. 
Let iV 2 be the first n when this happens. Choose a non-negative farthingale S2 
starting at ir N2 from si, S2 (t" 2 ) = Si, whose upper limit exceeds Si/a on all 
extensions of n N2 in £. Keep setting P(7T n ) := S 2 (ir n ), n = N 2 ,N 2 + 1, . . ., 
until 6*2 (7T n ) reaches a value s 2 > si(l/a) > (1/a) 2 . After that keep setting 
P(tt") := P^"" 1 ) until ps amc (S | tt") < a. Let N 3 be the first n when this 
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happens. Choose a non-negative farthingale S3 starting at ir Ns from s 2 whose 
upper limit exceeds sija on all extensions of n N3 in E. Keep setting V(w n ) := 
Sz{ir n ), n — N 3 ,N 3 + 1, . . ., until S3 reaches a value s 3 > s 2 (l/a) > (1/a) 3 . 
And so on. □ 

Lemma 5. If A\ C A2 C • • • C II is a nested sequence of prequential events, 

pgamc Qj^Ai) = lim P Same (^)- (17) 

i — >oo 

Proof. Let A\, A2, . . . be a nested increasing sequence of prequential events. The 
non- trivial inequality in (17) is <. For each Ai the process 

Si(x) := PS amo (A 4 I x) 

is a non-negative superfarthingale (see Lemma 6 below). By Proposition 2, 
limsup„ Si(ir n ) > 1 for almost all ir £ Ai. The sequence Si is increasing, Si < 
S2 < • • • , so the limit S := lim^oo Si = sup^ Si exists and is a non-negative 
superfarthingale such that S(A) = lim woo Ps amo (A t ) and limsup„ S(tt") > 1 for 
almost all it GLiiAi (by Lemma 1). We can get rid of "almost" by adding to S a 
non-negative farthingale V that starts at V(A) < e, for an arbitrarily small e > 

0, and satisfies limsup„ V(Tr n ) > 1 for all 7r e UjAj violating limsup„ S(n n ) > 

1. □ 

Lemma 6. For any prequential event E, the function x e H° 1— > ps amc (i<; | x) 
is a superfarthingale. 

Proof. Suppose there are x € IT and p e [0, 1] such that 

P gamo (£ I x) < (l-p)P slimc (E I x,p,0)+pP gamo (S I s,p,l). 

Then there exists a non-negative farthingale V with limsup„ V(n n ) > 1 for all 
7r e Bnr(i) that satisfies 

V(a;) < (1 - P )P S!imc (E I x,p,0)+ P P 6amc (£ | jc,p,l) 

and, therefore, 

{l-p)V(x,p,0)+pV(x,p,l) < (1 - p)P^ mc (E \ x, Pl 0) + P P^ mc (E I x,p,l). 

The last inequality implies that there exists j € {0,1} such that l^(x,p, j) < 
pgamc^ 1 x ,p,j), which is impossible. □ 

This completes the proof of Theorem 3. Let us now check that measure- 
theoretic probability is also a capacity. 

Lemma 7. The set function P mcas is a capacity. 
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Proof. Property (12) is obvious for f> meas . Property (14) follows from Lemmas 
2 and 4. 

Let us now check the remaining property (13), with p mcas as 7. Suppose 
there exists an increasing sequence A\ C A 2 C • • • C X of prequential events 
such that 

P moa "(u£ 1 A i ) > lim P mcas (A,). 
Let <f> be a forecasting system satisfying 

P^U^A,) > lim P moas (v4i). 

Then will satisfy P 1 * (Ug^i) > lim^^ P^(^), which is equivalent to the 
obviously wrong P (u^Af ) > lim;^ P*(Af ). □ 

In combination with Choquet's capacitability theorem, Theorem 3 and 
Lemma 7 allow us to finish the proof of Theorem 1 . 

Choquet's Capacitability Theorem ([1]). If X is a compact metrizable 
space, 7 is a capacity on X, and E C X is an analytic set, 

j(E) — sup{j(K) I K is compact, K C E} . 

For a proof of Choquet's theorem, see, e.g., [10], Theorem 30.13. 

Proof of Theorem 1. Combining Choquet's capacitability theorem (applied to 
the compact metrizable space n), Lemma 2, Theorem 3, and Lemma 7, we 
obtain 

pgame ^ = gup pgame (JQ = sup pmeas ^ = pmeas ^ 
KCE KCE 

K ranging over the compact sets. □ 

Remark 3. The fact that game-theoretic probability and measure-theoretic 
probability are capacities has allowed us to prove their coincidence on the ana- 
lytic sets, and it might be useful for other purposes as well. In general, neither 
of these capacities is strongly subadditive, in the sense of satisfying 

l{A \JB)+ >y(A n B) < 7(A) + 7(B) 

for all prequential events A and B. To demonstrate this it suffices, in view of 
Theorem 1, to find analytic sets A and B that violate 

pgame ^ [j B) + P gam0 (A n B) < P gam0 (A) + P gamo (B). (18) 

We can define P game (£") for subsets of II™ by (5) with limsup„ omitted. This is 
an example of subsets A and B of II 2 for which (18) is violated: 

0,0,i,oV fi,0,0,oH, (19) 



2' I ' V 2 
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(20) 



For these subsets we have 

pgamc^ y + pgame^ n ^) = l + I > I + I = pS*™^) + PS am0 (B). 

To obtain an example of subsets A and B of the full prequential space II for 
which (18) is violated, it suffices to add 00 . . . at the end of each element of the 
sets A and B defined by (19) and (20). 

7 Application to the limit theorems of probabil- 
ity theory 

The lower game-theoretic probability of a prequential event E is defined to be 1 — 
P same (Il\_E). Similarly, the lower measure-theoretic probability of a prequential 
event E is defined to be 1 - p mcas (n \ E). 

The game-theoretic strong law of large numbers (see, e.g., [16], Section 3.3) 
implies that (2) holds with lower game-theoretic probability one. The stan- 
dard martingale strong law of large numbers implies that (2) holds with lower 
measure-theoretic probability one. Our Theorem 1 establishes the equivalence 
between these two statements. Similarly, Theorem 1 establishes the equivalence 
between the game-theoretic law of the iterated logarithm for binary outcomes 
(a special case of Theorems 5.1 and 5.2 in [16]) and the martingale law of the 
iterated logarithm for binary outcomes in measure-theoretic probability theory. 

Transition from game-theoretic to measure-theoretic laws of probability, cor- 
responding to the inequality > in Theorem 1, depends only on Ville's inequality, 
and so can be easily done for a wide variety of prediction protocols (see, e.g., 
[16], Section 8.1). Transition in the opposite direction, corresponding to the 
inequality <, is more difficult, and its feasibility has been demonstrated only in 
a very limited number of cases. 

In an important respect Theorem 1 is only an existence result. For example, 
in combination with the standard martingale strong law of large numbers in 
measure-theoretic probability theory it implies the game-theoretic strong law of 
large numbers for binary outcomes, but the resulting farthingale is very complex. 
The corresponding strategy for the gambler (or Skeptic, in the terminology of 
[16]) is also very complex. This contrasts with the simple and efficient gambling 
strategies designed in game-theoretic probability: see, e.g., [16], Section 3.2, and 
[12]. 

It would be interesting to design efficient general procedures producing sim- 
ple gambling strategies witnessing that P gamc (_E) = for natural classes of pre- 
quential events satisfying p meas (£:) = o. For example, such a procedure might 
be applicable to all prequential events satisfying p moas (£;) — o and situated at 
a given low level of the Borel hierarchy. This would allow an automatic proce- 
dure of transition from measure-theoretic to constructive game-theoretic laws 
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of probability: e.g., the set of sequences (1) violating the strong law of large 
numbers (2) is in the class £3 of the Borel hierarchy, and the set of sequences 
violating the law of the iterated logarithm is in A°. 

In this article we have only considered the case where the outcomes y n are 
restricted to the binary outcome space Y := {0,1}. It is easy to extend our 
results to the case where Y is any finite set and Forecaster outputs probability 
measures on Y, interpreted as his probability forecasts for y n . It remains an 
open problem whether it is possible to modify our definitions in a natural way 
so that the equivalence between game-theoretic and measure-theoretic proba- 
bility extends to a wide classes of outcome spaces and prcquential events; this 
would require imposing suitable measurability or topological conditions on the 
farthingales (or superfarthingalcs) used in the definition (5) of game-theoretic 
probability. 
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